Overview

Dataset Statistics

Number of Variables 34
Number of Rows 1.0486e+06
Missing Cells 861084
Missing Cells (%) 2.4%
Duplicate Rows 6
Duplicate Rows (%) 0.0%
Total Size in Memory 1.4 GB
Average Row Size in Memory 1.4 KB
Variable Types
  • Categorical: 26
  • Numerical: 8

Dataset Insights

1st_Road_Class has 305589 (29.14%) missing values Missing
2nd_Road_Class has 439824 (41.94%) missing values Missing
2nd_Road_Number has 10803 (1.03%) missing values Missing
LSOA_of_Accident_Location has 71890 (6.86%) missing values Missing
Weather_Conditions has 21392 (2.04%) missing values Missing
1st_Road_Number is skewed Skewed
2nd_Road_Number is skewed Skewed
Latitude is skewed Skewed
Location_Northing_OSGR is skewed Skewed
Number_of_Casualties is skewed Skewed
Number_of_Vehicles is skewed Skewed
Accident_Index has a high cardinality: 671340 distinct values High Cardinality
Date has a high cardinality: 2191 distinct values High Cardinality
Local_Authority_(District) has a high cardinality: 422 distinct values High Cardinality
Local_Authority_(Highway) has a high cardinality: 212 distinct values High Cardinality
LSOA_of_Accident_Location has a high cardinality: 34226 distinct values High Cardinality
Police_Force has a high cardinality: 51 distinct values High Cardinality
Time has a high cardinality: 1439 distinct values High Cardinality
Date has constant length 10 Constant Length
Did_Police_Officer_Attend_Scene_of_Accident has constant length 3 Constant Length
LSOA_of_Accident_Location has constant length 9 Constant Length
Pedestrian_Crossing-Human_Control has constant length 3 Constant Length
Pedestrian_Crossing-Physical_Facilities has constant length 3 Constant Length
Speed_limit has constant length 2 Constant Length
Time has constant length 5 Constant Length
Year has constant length 4 Constant Length
Longitude has 914181 (87.18%) negatives Negatives
1st_Road_Number has 286845 (27.36%) zeros Zeros
2nd_Road_Number has 812534 (77.49%) zeros Zeros
  • 1
  • 2
  • 3

Variables


Accident_Index

categorical

Approximate Distinct Count 671340
Approximate Unique (%) 64.0%
Missing 0
Missing (%) 0.0%
Memory Size 77.3 MB

Length

Mean 12.2531
Standard Deviation 1.007
Median 13
Minimum 8
Maximum 13

Sample

1st row 200501BS00001
2nd row 200501BS00002
3rd row 200501BS00003
4th row 200501BS00004
5th row 200501BS00005

Letter

Count 1403984
Lowercase Letter 0
Space Separator 0
Uppercase Letter 1403984
Dash Punctuation 0
Decimal Number 10689021

1st_Road_Class

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.0%
Missing 305589
Missing (%) 29.1%
Memory Size 47.1 MB
  • The largest value (A) is over 3.54 times larger than the second largest value (B)

Length

Mean 1.4093
Standard Deviation 1.6292
Median 1
Minimum 1
Maximum 8

Sample

1st row A
2nd row B
3rd row C
4th row A
5th row C

Letter

Count 1041738
Lowercase Letter 296079
Space Separator 0
Uppercase Letter 745659
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (A, B) take over 50.0%
  • The largest value (a) is over 3.54 times larger than the second largest value (b)

1st_Road_Number

numerical

Approximate Distinct Count 6552
Approximate Unique (%) 0.6%
Missing 2
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 16.0 MB
Mean 1011.997
Minimum 0
Maximum 9999
Zeros 286845
Zeros (%) 27.4%
Negatives 0
Negatives (%) 0.0%
  • 1st_Road_Number is skewed right (γ1 = 2.0621)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 130
Q3 734.18
95-th Percentile 6004
Maximum 9999
Range 9999
IQR 734.18

Descriptive Statistics

Mean 1011.997
Standard Deviation 1832.0416
Variance 3.3564e+06
Sum 1.0612e+09
Skewness 2.0621
Kurtosis 3.2783
Coefficient of Variation 1.8103
  • 1st_Road_Number is not normally distributed (p-value 1.009259839713505e-23)
  • 1st_Road_Number has 188369 outliers

2nd_Road_Class

categorical

Approximate Distinct Count 6
Approximate Unique (%) 0.0%
Missing 439824
Missing (%) 41.9%
Memory Size 42.7 MB
  • The largest value (Unclassified) is over 4.02 times larger than the second largest value (A)

Length

Mean 8.512
Standard Deviation 5.0842
Median 12
Minimum 1
Maximum 12

Sample

1st row C
2nd row Unclassified
3rd row B
4th row C
5th row B

Letter

Count 5180111
Lowercase Letter 4570564
Space Separator 0
Uppercase Letter 609547
Dash Punctuation 0
Decimal Number 0
  • The largest value (unclassified) is over 4.02 times larger than the second largest value (a)

2nd_Road_Number

numerical

Approximate Distinct Count 6939
Approximate Unique (%) 0.7%
Missing 10803
Missing (%) 1.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 15.8 MB
Mean 387.0004
Minimum 0
Maximum 9999
Zeros 812534
Zeros (%) 77.5%
Negatives 0
Negatives (%) 0.0%
  • 2nd_Road_Number is skewed right (γ1 = 4.055)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 4058
Maximum 9999
Range 9999
IQR 0

Descriptive Statistics

Mean 387.0004
Standard Deviation 1316.6729
Variance 1.7336e+06
Sum 4.0162e+08
Skewness 4.055
Kurtosis 16.6373
Coefficient of Variation 3.4023
  • 2nd_Road_Number is not normally distributed (p-value 4.719598828188352e-25)
  • 2nd_Road_Number has 225238 outliers

Accident_Severity

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 71.1 MB
  • The largest value (Slight) is over 6.48 times larger than the second largest value (Serious)

Length

Mean 6.118
Standard Deviation 0.3629
Median 6
Minimum 5
Maximum 7

Sample

1st row Serious
2nd row Slight
3rd row Slight
4th row Slight
5th row Slight

Letter

Count 6415142
Lowercase Letter 5366567
Space Separator 0
Uppercase Letter 1048575
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Slight, Serious) take over 50.0%
  • The largest value (slight) is over 6.48 times larger than the second largest value (serious)

Carriageway_Hazards

categorical

Approximate Distinct Count 6
Approximate Unique (%) 0.0%
Missing 29
Missing (%) 0.0%
Memory Size 69.5 MB
  • The largest value (None) is over 114.63 times larger than the second largest value (Other object on road)

Length

Mean 4.4813
Standard Deviation 3.8798
Median 4
Minimum 4
Maximum 47

Sample

1st row None
2nd row None
3rd row None
4th row None
5th row None

Letter

Count 4608338
Lowercase Letter 3559792
Space Separator 77048
Uppercase Letter 1048546
Dash Punctuation 2257
Decimal Number 0
  • The top 2 categories (None, Other object on road) take over 50.0%
  • The largest value (none) is over 101.67 times larger than the second largest value (road)

Date

categorical

Approximate Distinct Count 2191
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 75.0 MB

Length

Mean 10
Standard Deviation 0
Median 10
Minimum 10
Maximum 10

Sample

1st row 04/01/2005
2nd row 05/01/2005
3rd row 06/01/2005
4th row 07/01/2005
5th row 10/01/2005

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 8388600
  • Date has words of constant length

Day_of_Week

categorical

Approximate Distinct Count 7
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 72.2 MB

Length

Mean 7.1693
Standard Deviation 1.1289
Median 7
Minimum 6
Maximum 9

Sample

1st row Tuesday
2nd row Wednesday
3rd row Thursday
4th row Friday
5th row Monday

Letter

Count 7517575
Lowercase Letter 6469000
Space Separator 0
Uppercase Letter 1048575
Dash Punctuation 0
Decimal Number 0

Did_Police_Officer_Attend_Scene_of_Accident

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 269
Missing (%) 0.0%
Memory Size 68.0 MB
  • The largest value (1.0) is over 4.17 times larger than the second largest value (2.0)

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row 1.0
2nd row 1.0
3rd row 1.0
4th row 1.0
5th row 1.0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2096612
  • The top 2 categories (1.0, 2.0) take over 50.0%
  • The largest value (10) is over 4.17 times larger than the second largest value (20)
  • Did_Police_Officer_Attend_Scene_of_Accident has words of constant length

Junction_Control

categorical

Approximate Distinct Count 7
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 90.5 MB

Length

Mean 25.5372
Standard Deviation 4.0073
Median 24
Minimum 9
Maximum 35

Sample

1st row Data missing or ou...
2nd row Auto traffic signa...
3rd row Data missing or ou...
4th row Data missing or ou...
5th row Data missing or ou...

Letter

Count 22663431
Lowercase Letter 21614856
Space Separator 3960445
Uppercase Letter 1048575
Dash Punctuation 0
Decimal Number 153832
  • The top 2 categories (Give way or uncontrolled, Data missing or out of range) take over 50.0%

Junction_Detail

categorical

Approximate Distinct Count 10
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 90.2 MB

Length

Mean 25.2299
Standard Deviation 9.5925
Median 25
Minimum 9
Maximum 35

Sample

1st row Not at junction or...
2nd row Crossroads
3rd row Not at junction or...
4th row Not at junction or...
5th row Not at junction or...

Letter

Count 21767128
Lowercase Letter 20718553
Space Separator 3772769
Uppercase Letter 1048575
Dash Punctuation 10000
Decimal Number 874409
  • The top 2 categories (Not at junction or within 20 metres, T or staggered junction) take over 50.0%
  • The largest value (junction) is over 1.82 times larger than the second largest value (not)

Latitude

numerical

Approximate Distinct Count 739288
Approximate Unique (%) 70.5%
Missing 111
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 16.0 MB
Mean 52.573
Minimum 49.9144
Maximum 60.7575
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Latitude is skewed right (γ1 = 0.9906)

Quantile Statistics

Minimum 49.9144
5-th Percentile 50.8321
Q1 51.4941
Median 52.3757
Q3 53.4736
95-th Percentile 55.8036
Maximum 60.7575
Range 10.8431
IQR 1.9795

Descriptive Statistics

Mean 52.573
Standard Deviation 1.4245
Variance 2.0291
Sum 5.5121e+07
Skewness 0.9906
Kurtosis 0.8686
Coefficient of Variation 0.02709
  • Latitude is not normally distributed (p-value 9.511266623645234e-13)
  • Latitude has 13612 outliers

Light_Conditions

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.0%
Missing 2084
Missing (%) 0.2%
Memory Size 76.5 MB
  • The largest value (Daylight) is over 3.67 times larger than the second largest value (Darkness - lights lit)

Length

Mean 11.6289
Standard Deviation 5.997
Median 8
Minimum 8
Maximum 27

Sample

1st row Darkness - lights ...
2nd row Darkness - lights ...
3rd row Darkness - lightin...
4th row Darkness - lights ...
5th row Darkness - lights ...

Letter

Count 11039682
Lowercase Letter 9993191
Space Separator 847365
Uppercase Letter 1046491
Dash Punctuation 282455
Decimal Number 0
  • The top 2 categories (Daylight, Darkness - lights lit) take over 50.0%
  • The largest value (daylight) is over 2.7 times larger than the second largest value (darkness)

Local_Authority_(District)

categorical

Approximate Distinct Count 422
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 75.8 MB
  • The largest value (Birmingham) is over 1.51 times larger than the second largest value (Leeds)

Length

Mean 10.778
Standard Deviation 4.8578
Median 10
Minimum 4
Maximum 28

Sample

1st row Kensington and Che...
2nd row Kensington and Che...
3rd row Kensington and Che...
4th row Kensington and Che...
5th row Kensington and Che...

Letter

Count 10755137
Lowercase Letter 9328588
Space Separator 473379
Uppercase Letter 1426549
Dash Punctuation 33116
Decimal Number 0

Local_Authority_(Highway)

categorical

Approximate Distinct Count 212
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 75.5 MB

Length

Mean 10.512
Standard Deviation 4.6491
Median 10
Minimum 4
Maximum 36

Sample

1st row Kensington and Che...
2nd row Kensington and Che...
3rd row Kensington and Che...
4th row Kensington and Che...
5th row Kensington and Che...

Letter

Count 10625158
Lowercase Letter 9316032
Space Separator 340761
Uppercase Letter 1309126
Dash Punctuation 20246
Decimal Number 0

Location_Easting_OSGR

numerical

Approximate Distinct Count 48520
Approximate Unique (%) 4.6%
Missing 111
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 16.0 MB
Mean 438307.695
Minimum 64950
Maximum 655540
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Location_Easting_OSGR is skewed left (γ1 = -0.2946)

Quantile Statistics

Minimum 64950
5-th Percentile 269180
Q1 377590
Median 438771.6
Q3 521840
95-th Percentile 579850
Maximum 655540
Range 590590
IQR 144250

Descriptive Statistics

Mean 438307.695
Standard Deviation 94792.9187
Variance 8.9857e+09
Sum 4.5955e+11
Skewness -0.2946
Kurtosis -0.3785
Coefficient of Variation 0.2163
  • Location_Easting_OSGR is not normally distributed (p-value 0.000981786625304733)
  • Location_Easting_OSGR has 1712 outliers

Location_Northing_OSGR

numerical

Approximate Distinct Count 72413
Approximate Unique (%) 6.9%
Missing 111
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 16.0 MB
Mean 298312.9813
Minimum 10520
Maximum 1.2088e+06
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Location_Northing_OSGR is skewed right (γ1 = 1.0)

Quantile Statistics

Minimum 10520
5-th Percentile 105034
Q1 178760
Median 276140
Q3 397560
95-th Percentile 658880
Maximum 1.2088e+06
Range 1.1983e+06
IQR 218800

Descriptive Statistics

Mean 298312.9813
Standard Deviation 158175.6719
Variance 2.502e+10
Sum 3.1277e+11
Skewness 1
Kurtosis 0.9002
Coefficient of Variation 0.5302
  • Location_Northing_OSGR is not normally distributed (p-value 1.9328449883327495e-09)
  • Location_Northing_OSGR has 13988 outliers

Longitude

numerical

Approximate Distinct Count 774515
Approximate Unique (%) 73.9%
Missing 112
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 16.0 MB
Mean -1.4545
Minimum -7.5162
Maximum 1.762
Zeros 0
Zeros (%) 0.0%
Negatives 914181
Negatives (%) 87.2%
  • Longitude is skewed left (γ1 = -0.3319)

Quantile Statistics

Minimum -7.5162
5-th Percentile -3.9856
Q1 -2.3373
Median -1.4235
Q3 -0.2316
95-th Percentile 0.5787
Maximum 1.762
Range 9.2782
IQR 2.1057

Descriptive Statistics

Mean -1.4545
Standard Deviation 1.3919
Variance 1.9374
Sum -1.525e+06
Skewness -0.3319
Kurtosis -0.3389
Coefficient of Variation -0.9569
  • Longitude is not normally distributed (p-value 0.0013367011944077488)
  • Longitude has 1386 outliers

LSOA_of_Accident_Location

categorical

Approximate Distinct Count 34226
Approximate Unique (%) 3.5%
Missing 71890
Missing (%) 6.9%
Memory Size 68.9 MB

Length

Mean 9
Standard Deviation 0
Median 9
Minimum 9
Maximum 9

Sample

1st row E01002849
2nd row E01002909
3rd row E01002857
4th row E01002840
5th row E01002863

Letter

Count 976685
Lowercase Letter 0
Space Separator 0
Uppercase Letter 976685
Dash Punctuation 0
Decimal Number 7813480
  • LSOA_of_Accident_Location has words of constant length

Number_of_Casualties

numerical

Approximate Distinct Count 41
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 16.0 MB
Mean 1.3599
Minimum 1
Maximum 68
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Number_of_Casualties is skewed right (γ1 = 6.3479)

Quantile Statistics

Minimum 1
5-th Percentile 1
Q1 1
Median 1
Q3 1
95-th Percentile 3
Maximum 68
Range 67
IQR 0

Descriptive Statistics

Mean 1.3599
Standard Deviation 0.8224
Variance 0.6763
Sum 1.426e+06
Skewness 6.3479
Kurtosis 184.6862
Coefficient of Variation 0.6047
  • Number_of_Casualties is not normally distributed (p-value 5.007849703722742e-25)
  • Number_of_Casualties has 250804 outliers

Number_of_Vehicles

numerical

Approximate Distinct Count 23
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 16.0 MB
Mean 1.835
Minimum 1
Maximum 32
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Number_of_Vehicles is skewed right (γ1 = 1.7764)

Quantile Statistics

Minimum 1
5-th Percentile 1
Q1 1
Median 2
Q3 2
95-th Percentile 3
Maximum 32
Range 31
IQR 1

Descriptive Statistics

Mean 1.835
Standard Deviation 0.7181
Variance 0.5157
Sum 1.9242e+06
Skewness 1.7764
Kurtosis 16.5321
Coefficient of Variation 0.3914
  • Number_of_Vehicles is not normally distributed (p-value 7.592397208522272e-21)
  • Number_of_Vehicles has 24296 outliers

Pedestrian_Crossing-Human_Control

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 21
Missing (%) 0.0%
Memory Size 68.0 MB
  • The largest value (0.0) is over 290.12 times larger than the second largest value (2.0)

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row 0.0
2nd row 0.0
3rd row 0.0
4th row 0.0
5th row 0.0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2097108
  • The top 2 categories (0.0, 2.0) take over 50.0%
  • The largest value (00) is over 290.12 times larger than the second largest value (20)
  • Pedestrian_Crossing-Human_Control has words of constant length

Pedestrian_Crossing-Physical_Facilities

categorical

Approximate Distinct Count 6
Approximate Unique (%) 0.0%
Missing 37
Missing (%) 0.0%
Memory Size 68.0 MB
  • The largest value (0.0) is over 13.74 times larger than the second largest value (5.0)

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row 1.0
2nd row 5.0
3rd row 0.0
4th row 0.0
5th row 0.0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2097076
  • The top 2 categories (0.0, 5.0) take over 50.0%
  • The largest value (00) is over 13.74 times larger than the second largest value (50)
  • Pedestrian_Crossing-Physical_Facilities has words of constant length

Police_Force

categorical

Approximate Distinct Count 51
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 77.5 MB
  • The largest value (Metropolitan Police) is over 3.02 times larger than the second largest value (West Midlands)

Length

Mean 12.4536
Standard Deviation 4.5209
Median 16
Minimum 4
Maximum 21

Sample

1st row Metropolitan Polic...
2nd row Metropolitan Polic...
3rd row Metropolitan Polic...
4th row Metropolitan Polic...
5th row Metropolitan Polic...

Letter

Count 12493429
Lowercase Letter 10955496
Space Separator 555280
Uppercase Letter 1537933
Dash Punctuation 9876
Decimal Number 0

Road_Surface_Conditions

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.0%
Missing 1189
Missing (%) 0.1%
Memory Size 70.5 MB
  • The largest value (Dry) is over 2.36 times larger than the second largest value (Wet or damp)

Length

Mean 5.5444
Standard Deviation 3.7748
Median 3
Minimum 3
Maximum 20

Sample

1st row Wet or damp
2nd row Dry
3rd row Dry
4th row Dry
5th row Wet or damp

Letter

Count 5147604
Lowercase Letter 4100218
Space Separator 656588
Uppercase Letter 1047386
Dash Punctuation 0
Decimal Number 1458
  • The top 2 categories (Dry, Wet or damp) take over 50.0%
  • The largest value (dry) is over 2.36 times larger than the second largest value (damp)

Road_Type

categorical

Approximate Distinct Count 5
Approximate Unique (%) 0.0%
Missing 7266
Missing (%) 0.7%
Memory Size 81.4 MB
  • The largest value (Single carriageway) is over 4.92 times larger than the second largest value (Dual carriageway)

Length

Mean 16.9856
Standard Deviation 2.239
Median 18
Minimum 9
Maximum 18

Sample

1st row Single carriageway
2nd row Dual carriageway
3rd row Single carriageway
4th row Single carriageway
5th row Single carriageway

Letter

Count 16692737
Lowercase Letter 15651428
Space Separator 994558
Uppercase Letter 1041309
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Single carriageway, Dual carriageway) take over 50.0%

Special_Conditions_at_Site

categorical

Approximate Distinct Count 10
Approximate Unique (%) 0.0%
Missing 16
Missing (%) 0.0%
Memory Size 69.2 MB
  • The largest value (None) is over 79.35 times larger than the second largest value (Roadworks)

Length

Mean 4.2454
Standard Deviation 2.155
Median 4
Minimum 3
Maximum 42

Sample

1st row None
2nd row None
3rd row None
4th row None
5th row None

Letter

Count 4416981
Lowercase Letter 3368422
Space Separator 32566
Uppercase Letter 1048559
Dash Punctuation 2029
Decimal Number 0
  • The top 2 categories (None, Roadworks) take over 50.0%
  • The largest value (none) is over 79.35 times larger than the second largest value (roadworks)

Speed_limit

categorical

Approximate Distinct Count 8
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 67.0 MB
  • The largest value (30) is over 3.77 times larger than the second largest value (60)

Length

Mean 2
Standard Deviation 0
Median 2
Minimum 2
Maximum 2

Sample

1st row 30
2nd row 30
3rd row 30
4th row 30
5th row 30

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 2097150
  • The top 2 categories (30, 60) take over 50.0%
  • The largest value (30) is over 3.77 times larger than the second largest value (60)
  • Speed_limit has words of constant length

Time

categorical

Approximate Distinct Count 1439
Approximate Unique (%) 0.1%
Missing 100
Missing (%) 0.0%
Memory Size 70.0 MB

Length

Mean 5
Standard Deviation 0
Median 5
Minimum 5
Maximum 5

Sample

1st row 17:42
2nd row 17:36
3rd row 00:15
4th row 10:35
5th row 21:13

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 4193900
  • Time has words of constant length

Urban_or_Rural_Area

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 85
Missing (%) 0.0%
Memory Size 70.0 MB
  • The largest value (Urban) is over 1.76 times larger than the second largest value (Rural)

Length

Mean 5.0003
Standard Deviation 0.04462
Median 5
Minimum 5
Maximum 11

Sample

1st row Urban
2nd row Urban
3rd row Urban
4th row Urban
5th row Urban

Letter

Count 5242798
Lowercase Letter 4194308
Space Separator 0
Uppercase Letter 1048490
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Urban, Rural) take over 50.0%
  • The largest value (urban) is over 1.76 times larger than the second largest value (rural)

Weather_Conditions

categorical

Approximate Distinct Count 8
Approximate Unique (%) 0.0%
Missing 21392
Missing (%) 2.0%
Memory Size 81.3 MB
  • The largest value (Fine no high winds) is over 6.62 times larger than the second largest value (Raining no high winds)

Length

Mean 18.0329
Standard Deviation 2.428
Median 18
Minimum 5
Maximum 21

Sample

1st row Raining no high wi...
2nd row Fine no high winds
3rd row Fine no high winds
4th row Fine no high winds
5th row Fine no high winds

Letter

Count 15497532
Lowercase Letter 14470349
Space Separator 2996040
Uppercase Letter 1027183
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Fine no high winds, Raining no high winds) take over 50.0%

Year

categorical

Approximate Distinct Count 6
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 69.0 MB

Length

Mean 4
Standard Deviation 0
Median 4
Minimum 4
Maximum 4

Sample

1st row 2005
2nd row 2005
3rd row 2005
4th row 2005
5th row 2005

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 4194300
  • Year has words of constant length

InScotland

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 43
Missing (%) 0.0%
Memory Size 67.1 MB
  • The largest value (No) is over 15.73 times larger than the second largest value (Yes)

Length

Mean 2.0598
Standard Deviation 0.237
Median 2
Minimum 2
Maximum 3

Sample

1st row No
2nd row No
3rd row No
4th row No
5th row No

Letter

Count 2159725
Lowercase Letter 1111193
Space Separator 0
Uppercase Letter 1048532
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%
  • The largest value (no) is over 15.73 times larger than the second largest value (yes)

Interactions

Correlations

Missing Values